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5 filed February 2, 2000, now U.S. Patent No. 6,542,954, the contents of which are 
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BACKGROUND OF THE INVENTION 
Field of the Invention: 

10 The present invention relates to an electronic device including a computer 

system incorporating a disk subsystem, a disk array, or a disk drive. The present 
invention also relates to a technology, which allows high-speed transfer by means of 
arrayed disks connected by a fabric switch. 



15 Description of the Prior Art: 

In general, the connection between a disc controller device and a plurality of 
disk drives in a disk array may be achieved, as disclosed in the JP-A No. 10-171746, 
by an SCSI interface or by a fibre channel arbitrated loop topology. 

The SCSI interface, which uses a time-divided data transfer method on one 
20 signal line, negotiates with its initiator one to one for one moment on one signal line 
for an access. 

The fibre channel arbitrated loop topology, on the other hand, may connect 
the initiator and disk drives In a loop by means of a serial interface, to enable time- 
division transfer of the data divided into frames to allow a number of communications 
25 with a plurality of devices at the same time and to allow up to 126 disk drive devices 
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to be connected. 

Disk drives will become more and more compact and higher density 
implementation thus will ultimately realize the use of more disk drive devices. Ideally 
the connection of a disk drive with its interface one to one should be implemented to 

5 enable a maximum transfer rate. 

An SCSI interface in the Prior Art adopts a one-to-one data transfer scheme 
for one moment in one signal line. This may be a drawback if one wishes to 
implement simultaneous communications between an initiator and a plurality of disk 
drives. The number of connectable disk drive units in one bus is also limited to 7 or 

10 1 5. When one connects a number of drive units for one-to-one negotiation on the 
SCSI interface, a plurality of interfaces are required, causing difficulty in mounting. 
Because the number of the connectable disk drive units in one controller Is so 
limited, one may encounter the necessity to add some further controller interfaces for 
connecting all units to a system. 

15 When using a fibre channel, a plurality of disk drive units may be connected to 

a single controller. In a case where the controller and disk drive units may be 
connected by implementing a fibre channel fabric switch for switching the 
connectivity, a substantial one-to-one connection between the controller and the disk 
drive units may be implemented. Although the controller may support a fabric 

20 protocol for switching connection, generic disk drive units support only a fibre 
channel arbitrated loop (FC-AL) protocol but not the fabric protocol. 

This resulted in that the switching connection may not be implemented, so 
that a loop connection had to be used with the FC-AL for sharing a same loop 
between plural disk drive units. 

25 More specifically, a device (a controlling device in this context) which supports 
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the fabric protocol has a World Wide Name (WWN), as its unique 24-bit address. 

The device may be logged in to the fibre channel fabric switch by using this unique 

address. A device (a disk drive unit) which supports only the FC-AL protocol but not 

the fabric protocol uses significant 16 bits in a same 24 bit address for verifying 
5 location within the loop and least 8 bits for the address AL-PA (Arbitrated Loop Port 

Address: each disk unit have a unique value in the loop) for logging in to a device (a 

controlling device managing the loop). 

In such a loop connection, if the number of disk drive units connected in the 

same loop is increased, a data transfer rate of disk drive units may ultimately exceed 
10 beyond a maximum data transfer rate of the loop, resulting in that the data transfer in 

this loop may be limited to efficiency of the maximum data transfer rate of the loop. 

The data transfer rate in such a loop will be decreased to that in an equivalent SCSI 

interface. 

15 SUMMARY OF THE INVENTION 

The present invention has been made in view of the above circumstances and 
has an object to overcome the above problems and to provide a switch connection 
having a protocol converter for converting a protocol used between a disk drive unit 
and a controlling device to allow the disk drive unit and the controlling device to be 

20 connected one to one in a switching connection. For a switch having such a protocol 
converter, such as an FL_Port in accordance with an ANSI standard. 

Additional objects and advantages of the invention will be according to part in 
the description which follows and in part will be obvious from the description, or may 
be learned by practice of the invention. The objects and advantages of the invention 

25 may be realized and attained by means of the instrumentalities and combinations 
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particularly pointed out in the appended claims. 



BRIEF DESCRIPTION OF THE DRAWINGS 

The following description of a preferred embodiment of the present invention 
5 may be best understood by reading carefully with reference to the accompanying 
drawings, in which: 

Fig. 1 is an overview of a preferred embodiment in accordance with the 
present invention; 

Fig. 2 is a schematic detailed block diagram of a disk array controller; 
10 Fig. 3 is a schematic detailed block diagram of a fibre channel fabric switch 

controller; 

Fig. 4 is a schematic block diagram of a fibre channel fabric switch; 
Fig. 5 is a schematic block diagram of a fibre channel fabric switch and an 
arbitrated loop; 

15 Fig. 6 is a schematic detailed block diagram of a fibre channel arbitrated loop 

controller; and 

Fig. 7 is a schematic detailed block diagram of a spare disk drive unit 
controller. 



20 DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

A detailed description of a preferred embodiment of an extemal storage 
device (a disk subsystem) embodying the present invention will now be given 
referring to the accompanying drawings. Fig. 1 shows an overview of the device. 
In the external storage device shown in the figure, N disk array controllers 
25 (controller section) 1-1 to 1-N (controllers in middle such as 1-2 are not shown, this 
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applies to hereinbelow) are connected to a host computer (not shown) in an upper 
side, and provide M disk drive interface (disl< drive l/F) controllers 2-1 to 2-M in a 
bottom side. The hardware configuration of the disk array controller will be described 
below in greater details. Each of M controllers of fibre channel fabric switch 3-1 to 3- 
5 M are respectively connected to the disk drive interface (l/F) controllers 2-1 to 2-M 
for controlling disk drive units through their fibre channel interface 5. L disk drive 
units are connected to one fibre channel fabric switch controller, a total of M by L 
disk drive units (4(1 ,1 ) to 4(M,L)) are connected to the fibre channel fabric switch 
controllers 3-1 to 3-M through fibre channel interfaces 6. 

10 Each of disk drive interface controllers 2-1 to 2-M and disk drive units 4(1 ,1 ) to 

4(M,L) has its unique identifier (ID number) for a loop protocol respectively. The fibre 
channel fabric switch controllers 3-1 to 3-M receive the ID numbers of the disk drive 
units to be connected from the disk drive interface controllers 2-1 to 2-M, to establish 
one-to-one connection between the corresponding disk drive interface controllers 2-1 

15 to 2-M and the disk drive units 4(1 ,1 ) to 4(M,L). 

Fig. 2 shows a hardware configuration of the disk array controllers 1-1 to 1-N. 
Data transferred thereto from the host computer (not shown) is temporarily stored in 
a cache memory controlled by a host interface controller 7 so as to be added with 
parity data by a parity data generator 9, then to be split into (a total of M segments 

20 of) data blocks and parity data block(s). These data blocks and parity block(s) will be 
stored to a respective disk drive group (not shown) by the disk drive interface 
controllers 2-1 to 2-M, which are corresponding interfaces. 

To transfer data to the host computer (not shown), if there is data to be 
transferred thereto in the cache memory 8, then the data in the cache will be 

25 transferred to the host by the host interface controller 7. If the data to be transferred 
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to the host is not in the cache memory 8 then the disl< drive interface controllers 2-1 
to 2-M will read split data segments out of the disk drive group, concatenate split 
data segment blocks in the parity data generator 9, and store the complete data 
temporarily in the cache memory 8, and the host interface controller 7 will transfer 
5 the data to the host. 

The foregoing embodiment depicts a data storage method in a case of a RAID 
system. However data may also be stored without the RAID system. Without the 
RAID system, parity data generator 9 does not exist. The data transferred from the 
host (not shown) are temporarily stored in the cache memory 8 and then written to 

10 any one of disk drive units in the disk drive group. When mirroring a same data will 
be written into the plural disk drive units. For reading out, the data will be read out of 
the disk drive units, stored temporarily In the cache memory 8 and the host interface 
controller 7 will transfer to the host. 

It should be noted that in the following description, another embodiment of 

15 disk subsystem using the RAID system will be described, however the embodiment 
may equivalently be made without using the RAID system. 

Fig. 3 shows a hardware configuration of the fibre channel fabric switch 
controllers 3-1 to 3-M. A protocol controller 16 (a first protocol controller) connected 
to the disk drive interface controller 2-1 detects ID number of the disk drive units 

20 4(1,1) to 4(1, L) to be accessed and controls a fibre channel protocol used. A protocol 
controller 16' (a second protocol controller) connected to the disk drive units 4(1 ,1 ) to 
4(1 ,L) allocates a new ID number for a fabric protocol of the disk drive units 4(1 ,1 ) to 
4(1 ,L) to the ID number for the loop protocol specific to the disk drive units 4, in order 
to report to a switch controller 1 7 the ID number of disk drive units 4(1 ,1 ) to 4(1 ,L) in 

25 charge- The switch controller 1 7, which maintains the ID numbers for these protocols 
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of the disk drive units 4(1 ,1 ) to 4(1 ,L) by using for example a table, may set the 
switch 18 based on the ID number (fabric protocol) received from the disk drive 
interface controllers 2-1 to 2-M so as to establish the one-to-one connection. 

In other words the switch controller 1 7 sets the corresponding 24-bit WWN 
5 address used in the fabric protocol to each of disk drive units 4 as a new ID number. 
Then the switch controller 1 7 may use a table to attempt to correspond the ID 
number for the loop protocol with the newly set ID number for the loop protocol. By 
corresponding the IDS the disk drive interface controller may establish a connection 
to the disk drive unit 4 by using the newly set ID number. In this specification this 

10 coordination of ID numbers may also be referred to as a protocol control or a 
protocol conversion. 

In addition, the protocol control may be set so as to be performed in the 
protocol controller 16', or may be set so as to switch the protocol controller 16 with 
the protocol controller 16' for the data transfer from the host computer and for the 

15 data transfer to the host, or for the data transfer for a normal operation and for the 
data transfer for an operation in a disk failure. 

Another configuration may also be used in which one of the protocol controller 
16 and the protocol controller 16' is used, in such a case an ID number detector 
means may be provided instead of the protocol controller 1 6, or an ID number 

20 allocating means may be provided instead of the protocol controller 16'. 

Alternatively, a protocol controller and switches may be provided within the 
disk drive interface controllers 2-1 to 2-M to allow direct connection to the disk drive 
units 4(1,1) to 4(1 ,L), instead of proprietary fibre channel fabric switches provided 
independently in the system. 

25 Fig. 4 shows an operation of the fibre channel fabric switch controllers 3-1 to 
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3-M. 

The disk array controller 1-1 stores data split to M segments into a disk drive 
group 10-1. The disk drive interface controllers 2-1 to 2-M in the disk array controller 
1-1 send the ID number of disk drive units belonging to the disk drive group 10-1 to 
5 the fibre channel fabric switch controllers 3-1 to 3-M so as to establish a switching. 
The protocol controller 16 in the fibre channel fabric switch controllers 3-1 to 3-M 
(see Fig. 3) detects the ID number sent to request the switch controller 1 7 to switch 
the switch connection in order to achieve the protocol control pertinent to the disk 
drive units. The switch controller 17 (see Fig. 3) switches a switch 18 (see Fig. 3) so 

10 as to connect the disk array controller 1-1 requesting connection to the requested 
disk drive unit 4 belonging to the disk drive group 10-1 . 

It should be recognized that since the disk array controller 1-1 is 
correspondingly connected to one disk drive group 10-1 through the fibre channel 
fabric switch controllers 3-1 to 3-M another disk an^ay controller 1-N and the disk 

15 drive group 10-2 may separately perform another data transfer without interference. 
When the disk array controller 1-N establishes a connection to the disk drive group 
10-L, the connection between the disk array controller 1-1 and the disk drive group 
10-1 and the connection between the disk array controller 1-N and the disk drive 
group 10-L can operate separately from each other to perform the data transfer at a 

20 maximum data transfer rate possible between each disk array controller and 
respective disk drive unit. 

Although not described in this specification the switch controller 17, when 
switching the connection as have been described above, may effectively maintain 
the maximum transfer window by switching the connection of the switch 18 upon 

25 reception of signals indicating that the disk drive unit connected thereto becomes 
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ready to read/write at a time of data read or data write. 

Fig. 5 shows anotlier extended embodiment in accordance with the present 
invention. In the embodiments above, the protocol controller 16 in a fibre channel 
fabric switch controller 3 was connected one to one to the disl< drive unit 4. In the 
5 present embodiment however, a same section is configured such that the protocol 
controller 16 is connected in loop to the plural disk drive units 4 through a fibre 
channel arbitrated loop controller 1 1 . In this manner, an array of a plurality of 
inexpensive disk drive units 4 may operate at the performance level equivalent to an 
expensive large disk drive unit of the same capacity. In this configuration not all disk 

10 drive units are connected in loop. Apparently the fibre channel arbitrated loop 
controller 1 1 and the plural disk drive units 4 form the single disk drive unit 4, The 
performance of accessing will not be degraded. 

Although not shown in the figure, if the maximum data transfer rate of the fibre 
channel interface is enough higher with respect to the accessing speed of disk drive, 

15 the number of disk drive units 4 may be increased without aggravation of access 
performance, by connecting the plural disk drive units 4 to the fibre channel 
arbitrated loop controller 1 1, by connecting the plural disk drive units in a same loop, 
and by sharing the maximum transfer rate of the fibre channel with the plural disk 
drive units 4. 

20 Fig. 6 shows a hardware configuration of the fibre channel arbitrated loop 

controller 11 used for the embodiment shown In Fig. 5. 

A fibre channel arbitrated loop controller 1 1 comprises a loop bypass circuit 
13, a plurality of disk drive unit attaching ports 12, and a fabric switch connector port 
15. From disk drive units 4 loop bypass circuit switching signal 14 is output, allowing 

25 ports to be bypassed in case of failure, In order to enable hot swapping of disks. 
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More specifically, loops will keep alive, other operating disks will not be affected, and 
the failed disk drive unit can be detached and/or new disk drive units can be added. 

Fig. 7 shows another extended embodiment in accordance with the present 
invention. 

5 The present embodiment comprises spare disk drive unit controllers 19 each 

connected to respective fibre channel fabric switch controllers 3-1 to 3-M, a plurality 
of spare disk drive units 4-a and 4-b each connected to the spare disk drive unit 
controllers 19. The spare disk drive units 4-a and 4-b are provided in common to all 
(disk drive units connected to) switch controller circuits. These spare disk drive units 

10 4-a and 4-b may be either connected in loop to a spare disk drive unit controllers 19, 
or switched. 

Within the fibre channel fabric switch controller 3, the protocol controller 16' 
(see Fig. 3) connected to a disk drive group having a failed disk drive unit 4 (in the 
figure the disk drive unit 4(1 ,2)) may be connected to the spare disk drive unit 
15 controllers 19 through the switch controller 3-1 . 

If a disk drive unit is not operating well, the disk array controller 1-1 to 1-n 
attempts to rebuild the data structure in the spare disk drive unit 4-a or 4-b. When a 
specific disk drive unit 4 has so many operational errors that a failure of disk drive 
unit mechanism is forecasted, the array controller 1-1 picks up and copies data 
20 stored in the malfunctioning disk drive unit 4 to a spare disk drive unit 4-a or 4-b and 
rebuilds the disk array. 

The switch controller of the present embodiment then has an internal 
configuration or layout of a switch 18' slightly different from the switch controllers as 
described in the preceding embodiments so as to enable input from the disk drive 
25 units to be output to the disk drive units 4. For example, another switch 1 8' may be 
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provided between the protocol controller 16' and the switch 18 to determine 
according to the request from the spare disk drive unit controller 18 whether the 
output from the disk drive units is routed to the switch 18 or routed to another 
protocol controller 16'. 

If a disk drive unit fails and the data stored therein cannot be read out, lost 
data may be reconstructed in the cache memory 8 and parity data generator 9 in the 
disk array controller 1 from the data stored in the other disk drive units of the same 
disk drive group as the failed disk drive unit 4 to rebuild the data into the spare disk 
drive unit 4-a or 4-b. 

It should be noted that the switch controller 3 identical to the preceding 
embodiments may be used because the disk array controller 1 may be served for the 
data recovery when the data stored in an erroneous disk drive unit 4 is copied to a 
spare disk drive unit. 

In addition, the spare disk drive unit controllers 19 may be independently 
served for the data recovery from a failed disk drive unit. To do this, cache memory 
and parity data generator should be provided in the spare disk drive unit controllers. 
19. The spare disk drive unit controllers 19 may read out data from the disk drive 
units other than the failed unit in the same group to regenerate the lost data 
segments and store thus generated data in a spare disk drive unit 4-a or 4-b. 

The data recovery without affecting to the data access operation from the host 
computer may be achieved by performing access operation to the spare disk drive 
unit controllers 19 from the failed disk drive unit 4 or from other disk drive units 
storing split data including the parity for the recovery of errors, independently of the 
data access operation between the disk drive units 4 (disk drive group comprising 
disk drive units 4(1 ,1 ) to 4(1 ,L) in the figure) and the host computer via the disk array 



11 



controllers 1-1 to 1-N. 

In a similar manner, when a failed disk drive unit has been hot-swapped with 
a disk drive unit off the shelf, the recovery of failed unit may be achieved without 
affecting any access from the host, as the spare disk drive unit controller 15 may 
5 establish one-to-one connection for the fibre channel fabric switch controllers 3-1 to 
3-M, switched from the spare disk drive units 4-a and 4-b to a healthy disk drive unit 
newly hot-swapped with a failed disk drive unit, to perform data copy/recovery 
independently of the access from the disk array controller 1-1 to 1-N to the disk drive 
group 10-1 to 10-L (see Fig. 4). 

10 The present invention provides the connectivity of the plural disk drive units to 

a disk drive unit interface without compromising the transfer performance by using 
the fibre channel interface, which is a scheme of serial interface, and by applying a 
fibre channel fabric topology, which allows hot swapping of connectivity. The present 
invention further provides a solution of controlling the plural disk drive units with one 

15 or a few disk drive unit controllers, by hot-swapping the connectivity for each 

controller and disk drive group. In addition, the present invention provides Improved 
reliability of the system by performing the operation of data recovery in case of disk 
drive unit failure, independently of the data transfer between the disk drive interface 
controllers and the disk drive units. 

20 The foregoing description of the preferred embodiment of the invention has 

been presented for purposes of illustration and description. It is not intended to be 
exhaustive or to limit the invention to the precise form disclosed, and modifications 
and variations are possible in light of the above teachings or may be acquired from 
practice of the invention. The embodiment chosen and described in order to explain 

25 the principles of the invention and its practical application to enable one skilled in the 
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art to utilize the invention in various embodiments and witli various modifications 
are suited to the particular use contemplated, it is intended that the scope of the 
invention be defined by the claims appended hereto, and their equivalents. 
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